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1 Introduction 



The addressed problem and the results Let Zjv be a M x N complex- valued matrix defined by 

T. N = B N + aMV N (1) 

where Bjv is a M x Af deterministic matrix such that s\ip N ||Bjvl| < +00, and where Wjv = [Wjv] u is a 
M x N complex Gaussian random matrix with independent identically distributed (i.i.d) entries such 
that E [ [Wjv] ;j ] = 0, | E [ [Wjv] ij ] \ 2 - jj , Re ( [Wn] i,j ) and Im ( [Wjv] ij ) are i.i.d zero mean real Gaussian 
random variables. Model Q] is referred in the literature to as the information plus noise model (see 
e.g Dozier-Silverstein [14)). In this paper, we assume that Rank(Bjv) = K[N) - K < M because this 
assumption is verified in a number of practical situations, in particular in the context of the spiked 
models addressed here. 

The purpose of this paper is to study the almost sure location of the eigenvalues A\ r N > . . . > Am.jv of 
the Gram matrix ZjvZj^ when M and Af converge to +00 such that the ratio cjv = |l converges towards 
a constant c> and to take benefit of the results to obtain, using a different approach than Benaych- 
Nadakuditi |7), the behaviour of the largest eigenvalues of the information plus noise spiked models 
for which the rank K of Bjv remains constant when M and Af increase to +00. 

The empirical spectral measure (or eigenvalue distribution) /ijv = jg£m=i^i of matrix Z/v^/v 
has the same asymptotic behaviour than a deterministic probability distribution /x/v (see e.g. Dozier- 
Silverstein [TU Th.1.1] or Girko |T6j Th.7.4]) whose support is the union of disjoint compact inter- 
vals called in the following the clusters of Sf^. The boundary points of each cluster coincide with the 
positive extrema of a certain rational function depending on the empirical spectral measure of matrix 
B/vB^, a 2 and on the ratio cjv = ^ (see [28), Thereom 2). Each cluster J> of S^fa appears to be natu- 
rally associated to another interval containing a group of consecutive eigenvalues of B/vB^ ([28]). It is 
shown in [28] that the property proved in Bai-Silverstein [2] holds in the context of model (TJ. Roughly 
speaking, it means that for an interval [a, b] located outside for N large enough, no eigenvalue of 
ZjvZj^ belong to [a, b] almost surely, for all large N. 

In this paper, we establish the analog of the property called in Bai-Silverstein |3] "exact separation": 
almost surely, for N large enough, the number of eigenvalues of Z-n^n less than a (resp. greater than 
b) coincides with the number of eigenvalues of BjvB^ associated to the clusters included into [0, a] 
(resp. included into [b,oo)). Note that these results also hold in the case where K - M, not treated in 
this paper. Indeed, the analysis of the support S^n provided in [28 j can be extended when B/vB^ is full 
rank. Once the characterization of the support is established, the probabilistic part of the proof of the 
above mentioned exact separation result eigenvalues can be used verbatim. 

We also use the separation result to study the case where Rank(Bjv) = K is independent of N. It is 
assumed that for each k = 1,...,K, the non zero eigenvalues of B/vB^ satisfy limjv— +00 hk,N - The 
support of fi n is first characterized in this case, and using the above results related to the almost 
sure location of the (Afcjv)t=i M> it is proved that if Afc > a 2 \fc, then, 

5 {a 2 + X k ){a 2 c + X k ) 

&k,N^ Z , (A) 

Ajfc 

and that if X k < a 2 \fc, then, 

A fc ,iv-o- 2 (l + v^) 2 . (3) 
This behaviour was first established in |7| using a different approach. 

Motivations Our work has been originally motivated by the context of array processing in which the 
signals transmitted by K < M sources are received by an array equiped with M sensors. Under certain 
assumptions, the M-dimensional vector y{ri) received on the sensor array at time n can be written as 

K 

y{n) = £ d k s k {n) + v(n), (4) 
fc=i 
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where each time series (5fc(/i)) ne z represents a non observable deterministic signal corresponding 
to source k and where is an unknown deterministic M-dimensional vector depending on the di- 
rection of arrival of the fc-th source. (v(n))«ez is an additive complex white Gaussian noise such 
that E[v(ra)v(n)*] = o- 2 l M - It is clear that (4) is equivalent to Q] if we put Zjv = AT 1/2 [y(l),...,y(A0], 
Wjv = AT 1/2 (T _1 [v(l), . . . MN)] and B N = AT 1/2 D[s(l), . . . , s(AT)], with s(ra) = [s^ri),..., s K (ri)] T and D = 
tdi,...,dic]. 

Model (|4) poses important statistical problems such as detection of the number of sources K or 
estimation of the direction of arrivals of the K sources. A number of estimation schemes based on the 
eigenvalues and the eigenvectors of matrix ZjvZ^ were developed, and analysed if JV — ► +00 while M 
remains fixed. If however M and N are of the same order of magnitude, the above technics may fail, 
and it is therefore quite relevant to study these statistical problems in the asymptotic regime M, N — ► 
+00 in such a way that t| — ► c, c e (0, +00) . The number of sources may be constant or scale up with the 
dimensions M and N. For this, the first step is to evaluate the behaviour of the eigenvalues of ZjvZ^. 

About the literature Concerning the zero-mean correlated model. The problems addressed in this 
paper were studied extensively in the context of the popular zero-mean correlated model defined by 

•L N = H N XV N , (5) 

where is a deterministic M x M matrix and where \V/v is a random matrix with possibly non Gaus- 
sian zero mean variance -k i.i.d entries. The most complete results concerning the almost sure lo- 
calization of the eigenvalues of Z^Z^ are due to Bai-Silverstein [2] [3] and were established in the 
non Gaussian case. Spiked models were first proposed by Johnstone (20] in the context of (5) (ma- 
trix Hat is a diagonal matrix defined as a finite rank perturbation of the identity matrix). Later, Baik 
et al. (4) studied, in the complex Gaussian case, the almost sure convergence of the largest eigenval- 
ues of Z^Z^ and established central limit theorems. The analysis of (4] uses extensively the explicit 
form of the joint probability distribution of the entries of Zjv. Using the results of [2] [3] as well as the 
characterization of the support of the limiting distribution /ijv of the empirical eigenvalue distribution 
/ijv (see Silverstein-Choi 1271 ). Baik-Silverstein (5] addressed the non Gaussian case, and showed the 
almost sure convergence of certain eigenvalues of ZajZ^. Mestre considered in fJTJ the case where 
HatH^ has a finite number of different positive eigenvalues having multiplicities converging to +00, 
and showed how to estimate the eigenvalues of HajH^ as well as their associated eigenspace. Similar 
ideas were also developed in [22] in order to address the source localization problem in the context of 
large sensor arrays when the source signals are i.i.d. sequences. The analysis of Mestre (22JETJ is based 
on the results of [2] [3] as well as on the observation that it is possible to exhibit contours depending 
on the Stieljes transform of i^n, and enclosing each eigenvalue of HajH^. Paul studied in [26] the be- 
haviour of the eigenvectors associated to the greatest eigenvalues of a Gaussian spiked model (almost 
sure convergence and central limit theorems). Bai and Yao showed in [T] that certain eigenvalues of a 
non Gaussian spiked model satisfy a central limit theorem. We finally note that the above results on 
zero-mean spiked models have been used in the context of source localization (see [1911231 ). 

Concerning the information plus noise model. Except our paper |28| devoted to the source local- 
ization of deterministic sources, the almost sure location of the eigenvalues of matrix ZatZ^ was not 
studied previously. In [28] , we however followed partly the work of Capitaine et al. [9] , devoted to finite 
rank deformed Gaussian (or satisfying a Poincare inequality) Wigner matrices, which was inspired by 
previous results of Haagerup and Thorbjornsen [171 . See also the recent paper [10] in which the rank 
of the deformation may scale with the size of the matrix. We used in [28] the same approach to prove 
that for N large enough, no eigenvalue of Za/Z^ is outside the support of fiN- In [28], under the 
assumption that the eigenvalue of BatB^ is "far enough" from the others, we established a partial 
result showing that the M-K smallest eigenvalues of ZajZ^ are almost surely separated from the oth- 
ers. In the present paper, we prove a general exact separation property extending the result of [5] to 
the complex Gaussian information plus noise model. 

The almost sure behaviour (2), l[3), of the largest eigenvalues of information plus noise spiked mod- 
els appears to be a consequence of the general results of EDIT] devoted to the analysis of certain random 
models with additive and/or multiplicative finite rank perturbation. (2} and (3} are therefore not new, 
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but the technics of completely differ from the approach used of the present paper which can be 
seen as an extension to the information plus noise model of the paper [5] . 

Organization of the paper In section[2] we review some results of 1 13 and 28 concerning the sup- 
port SPfj of /ijv as well as some useful background material. As (28] assumed cjv < 1, we address the case 
cn = 1 and prove some extra results concerning the behaviour of the Stieltjes transform of /ijv around 
0. In section|3] we prove the analog of the exact separation of (3). [9] generalized the approach of (3] to 
prove this property in the finite rank deformed Wigner model. We however show that it is still possible 
to use again the ideas of [17] . We establish that it is sufficient to prove that the mass (w.r.t. pn) of any in- 
terval J* of S^n is equal to the proportion of eigenvalues of B n#* n associated to J*\ For this, we evaluate 
an integral along a certain contour enclosing the eigenvalues of B^B^ associated to J> . This contour 
is the analog of the contour introduced by (21] in the context of model (5} and was extensively used in 
(28]. Section|4]addresses the behaviour of the largest eigenvalues of an information plus noise spiked 
model. We analyse the support SP^ of /ijv, which appears equivalent to evaluate the positive extrema of 
a certain rational function. Using results concerning perturbed third order polynomial equations, it is 
shown that if A k ^ a 2 s/c for k - l t ... ,K, the intervals of 5% are [a 2 {I- ^/c^) 2 + 0{\l M),a 2 (\ + v r c^) 2 + 
0(1/ M)] and [^ N U k ,N + v 2 c N )ttk,N + o 2 ) - + (M" 1/2 ), A^A^ + a 2 c N )U k ,N + a 2 ) + + (M" 1/2 )], 
where k is any index for which A^jv > o 2 \fc, and where + {M~ 112 ) represents a positive 0{M~ 112 ) 
term. The results of section|3]imply immediately l|2} and (3} when X k ^ a 2 \fc for k = 1,... ,K. If one 
the [hk)k=\,...,K is equal to o~ 2 \[c, we use an argument similar to Baik-Silverstein (5), which relies on an 
eigenvalue perturbation technic. 

Model and assumptions We now summarize the model and assumptions which will be used in the 
paper, and introduce some definitions. Let M,N,K e N* such that 1 < K < M, K - K{N) and M - 
M{N) , functions of N with cjv = 77 —> c>0as N —> 00. We consider aMxN random matrix Z n defined 
as 



where o > and Bjv and Wjv satisfy the two following assumptions. 

Assumption A- 1 : MatrixBN is deterministic and satisfies sup n \\Bn\\ < +00. 

Assumption A-2: The entries ofmatrix'WN are i.i.d and follow a standard complex normal 
distribution Sf^yKfO, -^). 

Note that the Gaussian assumption A-2 will be only required in section|3] All the results in section 
|2]concerning the convergence of the spectral distribution of ^n^-* n are also valid in the non Gaussian 
case. In the following, we study the context where 

Assumption A-3: BjvB^ is rank deficient, and the non zero eigenvalue of B^B* N have multiplicity 1 . 

The assumption on the multiplicities of the eigenvalues of Bn B* N is not really necessary, but it allows 
to simplify the notations. We denote by K the rank of BatB^ [K may depend on N), and by Ai jv > 
Az.n > ■■■> Xk,n > Ajf+i,iv = . . . = Xm,n = its eigenvalues. We also assume that 
Assumption A-4: cjv = 77 £ 1 for each N. 

This of course implies that c < 1. Assuming cjv < 1 does not introduce any restriction because if 
Cjv > 1, the eigenvalues of ZjvZ^ are with multiplicity M— N as well as the eigenvalues of matrix 
Z^Zjv- The location of this set of eigenvalues can of course be deduced from the results related to 
c N < 1. 

In this paper, ^^(R,!?) will denote the set of infinitely differentiable functions with compact sup- 
port, defined from U to RL If sd c R, dsd and Int(^) represent the boundary and the interior of 
respectively. 

We finally recall the definition and useful well known properties of the Stieltjes transform, a funda- 
mental tool for the study of the eigenvalues of random matrices. Let p, be a positive finite measure on 
U. We define its Stieltjes transform V P ^ as the function 



2.n = Bn + ctXVn, 
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where supp(/i) represents the support of measure p. We have the following well-known properties 
Property 1. T^ satisfies 

1. V P ^ is holomorphic on C\supp(/i) . 

2. z e C + implies y^z) e C + . 

3. Ifp{U~) = 0, then z¥ p (z) e C + ifz e C + . 

2 Characterization of the support Sf N of measure \x N 

In this section, we recall some known results of [13] and [28] related to the support Sf^ of measure p^. 
As we assumed in (28] that cjv < 1, we also provide, when it is necessary, some details on the specific 
case cjv = 1. 

2. 1 Convergence of the empirical spectral measure fix of Z n^* n torward /i n 

We recall that /ijv is defined by = Af _1 Z^i^. • Its Stieltjes transform mjv is given, for all z e 
C\{Ai,iv,...,lM,]v}, by 



The following result, concerning the convergence of mjv(z) can be found in [14] Th.1.1], [16] Th.7.4] 
(see also (18 Th.2.5] for a more general model). 

Theorem 1. It exists a deterministic probability measure /ijv, such that pn -/ijv^OasAT^oo with 
probability one. Equivalently the Stieltjes transform % of /ijv satisfies rhiyiz) - mNiz) — * almost 
surely 1 V z e C\R + . Moreover, Vz e C\R + , mjv(z) is the unique solution of the equation, 



satisfying lm(zmN{z)) > OforzE C + . 

The behaviour of the Stieltjes transform mjv around the real axis is fundamental to evaluate the 
support of Pn- The following theorem is essentially due to fL3l . 

Theorem 2. 1. Ifcpj < 1, the limit ofmpjiz), as z e C + converges to x, exists for each xeU. and is still 
denoted by m^ix). IfcN- 1, the limit exists for x^Q. The function x —> mjv(x) is continuous on K 
if Cjv < 1 and on U* ifc^ = 1. It is also continuously differentiable on U\3S^n. 

2. If cjv < 1, f/ze« Re(l + cr 2 cjvmjv(z)) > 1 12 for each z e C + ui, and ifcpj= 1, this inequality holds on 
C + uR*. 

3. mjv(x) 25 a solution of (6) for x e R\dS?N. 

4. Measure pn is absolutely continuous and its density is given by fa N (x) - 7r _1 Im(rajv(x)). 

The statements of this theorem are essentially contained in [13] Th.2.5] (see also [28) for more de- 
tails), except item [2] because it is shown in [13 Lem.2.1] that Re(l + ct 2 cjv?^jv(z)) > 0. We therefore 
prove item[2]in the Appendix|A| 

We note that as mjv is a Stietljes transform, it also satisfies mjv(z*) = mjv(z)*. Therefore, it holds 
that mjv(z) — >■ mjv(x)* as z e C~ —- x, for x e U if cjv < 1 and for x e R* if cjv = 1. 

In the following, we denote by /jv, </>jv and wn the functions defined by 




BjvB* 



-l 



mjv(z) = — Tr -z(l + a 2 c N m N {z))l M + o 2 {\ - c N )l M + 



l + cr 2 cjvmjv(z) 



(6) 



f N {w) = — Tr (BjvB^ - w\ M ) , 

<p N {w) = w{\- a 2 c N f N {w)) 2 + a 2 {\ - c N ) (l - <7 2 Cjv/jv(w)) 
wjv(z) = z(l + er 2 cjvmjv(z)) 2 -er 2 (l - cjv)(1 + CT 2 cjvmjv(z)). 



(7) 
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Functions wn and cp^ are of crucial importance because, as shown in (28], the interior of <5*5v is given 
by Int^jv) - {x > 0,Im(u/jvM) > 0} and for each x e M\dS?N> w^{x) is a solution of the equation 
<Pn{w) = x. The characterization of Stfo proposed in |28|, based on a reformulation of the results in 
[131 Th.3.2, Th.3.3], consists in identifying u>n(x) out of the set of solutions of <Pn{w) = x. 
We also note that (|6) is equivalent to 



m N {z) 



1 + cr 2 cjvmjv(z) 

and that the identity 

1 



f N {w N {z)), (8) 



l + (T 2 CNm N {z) 

holds for z £ C + u U if c N < 1, or for z e C + u R* if c N = 1. 



l-a 2 c N f N (w N (z)) (9) 



2.2 Properties of cb N and if jv, and characterization of ,5% 

In this paragraph, we recall the main properties of functions 0jv and it/^, as well the structure of Sf^. 
Lemmas Q] [2] as well as theorem [3] are proved in |28 Prop.3, Th.2] for Cjv < 1, but the derivations for 
cjv = 1 are similar, except items[6]and[8]of lemma[2l 

Lemma 1. 1. The function cp^ admits2Q^ non-negative local extrema counting multiplicities (with 
1 < Qjv < K+l) whose preimages are denoted w^ N < < w^ N < w^ N ■ ■ . < w w < if q n n - 

2. Define x~ >N = (p N {w~ N ) and x+ N = c/) N { «/+ N ) forq=l...Q N . Then, 

X 1,N < X 1,N - X 2,N < ■■■- X Q N ,N < X Q N ,N' 

andx^ N > «/cjv < 1 while x^ N - i/cjv = 1. 

3. For q - 1,...,Qn, each interval [w~ N , w^ N ) contains at least one element of the set of eigenvalues 
{Aijv. ■ ■ ■ , ^k,n, 0} and each eigenvalue of BnB* n belongs to one of these intervals. 

4. (pN is increasing on the intervals (-oo, wj~ N ], [w^ N , w^ N ], ■ ■ ■ , [Wq^.j n , Wq n n ], ^ w q n n' +oc ^' 
and moreover 

<£iv((-oo, w^ N ]) = {-oo,x^ N ], 

<PN ( [ Wq, N , W~c, + l, N \ ) = \ X q,N< X q + l, N ] f° r each q = h ■ ■ ■ ,Qn ~ h 

^N^w^^+oo)^ = [x+ NiN ,+oo). 

In figureQ] we give a typical representation of function </>jv- We are now in position to recall the 
characterization of presented in (28] Th.2] . 

Theorem 3. The support S?n is given by 

Qn f 

^JV = U \ X q,N> X q,N\< 

with x~ N = if cn = 1 and x~ N > ifcpj < 1. 

The intervals {[x~ N ,x^ N ])q = i i ,„ t Q N will be called the clusters of SP^. Cluster [x~ N , x* N ] corresponds 
to the interval \w~ N , w* N ] in the sense that x~ N = (Pn{w~ n ) and x* N - (p^(w^ N ). Finally, we shall 
say that an eigenvalue A^at of BjvB^ is associated to cluster lx~ N ,x^ N ] if \k iN £{w~ N , w^ N ). 

In the same way as in theorem[2] we set wjv(x) = lim ze c+ z— x w^{z) for x e U if c/v < 1 and for x e U* 
if Cjv = 1. We notice that lim z£ c ,z— x u>n{z) — wnM* . Function x — ► Wm(x) satisfies the following 
properties. 

Lemma 2. The following properties hold 
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<ptw) 



Support 5" 




Figure 1: Function (p for K - 4 and c < 1. Here Q = 3. 

2. jc — ► WnW is continuous on U ifc^ < 1 and on K* zf c/v = 1, zs continuously dijferentiable on 

2. w jv zs reaZ arzd increasing onU\ . 

3. \-G 2 c N f N {w N {x))^QforxeW\d5fi N . 

4. x e Int(5 < jv) if and only ifw^{x) e C + . 

5. Forxe M\dS?N, u>n(x) is a solution of the equation <Pn{w) - x. Ifx e IntGS^v), w jv(x) is the unique 
solution belongingtoC + andifxeS?^, wpj{x) is the unique solution satisfying <P' n {wn{x)) > and 
l-a 2 c N f N {w N {x)) > 0. 

6. Function x—^ ztfjvM is continuous at x = x^ N = for cm = 1. 

7. Forq = \,..., Q N , w N (x~ N ) = w~ N and w N {x + q N ) = w + qN . 

8. Let q - 1,..., Qjv- Then, there exists a constant C > and neighborhoods Y(x~ N ), Y{x q N ) of re- 
spectively x~ N and x + n N such that, 



q,N ■ 



\w' N {x)\ <c\x-x\ 
\w' N {x]\ < C \x-x 



q,N\ 



-1/2 



-1/2 



q,N\ 



\/xenx qN )nU\{x qN }, 
Vxeyu; Ar )nl\{x fl + w ) 



q,N> 



(10) 

(11) 



The lemma was proved in (28] Prop.2, Lem.3] in the case cjv < 1. The proofs extend easily to cjv = 1, 
except items[6]and[8]for q =1. These 2 statements are proved in the Appendix[S] 

We finish this section by showing that the following result holds. 
Corollary 1. We have 

su P x 0iv,JV <00 ' 

N 

i.e. Ujv^jv is a bounded set. 

Proof: We define X max by X max = sup N \\B N \\ 2 . It follows that for w > X ma x, 

sup|/jv(w)| : 

N 



SUp : 

JV 

SWp\wfUw)\ : 



\^max 


U>\ 


1 




\^max 


W\ 2 


W 




l^-max ~ 


W\ 2 
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and since <p' N [w) = ( 1 - a 2 cjy /jv ( ) 2 -2a 2 c^w f' N {w){\- a 2 cn /n{uj)) - cr 4 : cjv ( 1 - cjv) f N {w) converges 
towards 1 when w — ► +oo, we deduce that for e > 0, 3w £ > X ma x such that V w > w e , <p' N [w) > e for all 
N. Since ([>' n {Wq n n ) - 0, this implies that 

SU P w On n- w c< +°°- 

JV 

Moreover, using w^ N = w N {x^ N ) = Xq n N {1 + a 2 c N m N (x+ NN )) 2 - ct 2 (1 - c N ){\ + (T 2 c N m N (x+ N N », 
and item[2]of theorem[2] we get that 

. W e CT 2 (1-Cjv) J 
xX ki N < 5 t ~ + 7. T <4W c + 2ff". 

Qn ' n {\ + a 2 c N m N {x + QN _ N )) 2 l + a 2 c N m N (x+ NiN ) 
This completes the proof. □ 



3 Almost sure location of the sample eigenvalues. 

We first recall the following result of [28] Th.3], which states the almost sure absence of eigenvalue of 
ZatZ^ outside the support £f^ of /i/v for all large N. This property is well-known in the context of zero 
mean non Gaussian correlated matrices (see [2])- We note that the proof of theorem [28 Th.3] uses 
extensively that W« is Gaussian (assumption A- 2). 

Theorem 4. Let a,b eU, e > and Nq e N such that {a - e, b + e) n Sf^ - for each N > Nq. Then, with 
probability one, no eigenvalue o/Z atZ^ belongs to [a, b] for N large enough. 

We remark that theorem [4] extends to semi-infinite intervals [b, +oo) because, as ||WjvW^|| — ► (1 + 
y/c) 2 almost surely, then it holds that Ai ; jv = HZjvZ^H < supjy HBatB^H + 2<7 2 (1 + y/c) 2 almost surely for 
N large enough. 

In order to interpret this result, assume that for each N > N\ > Nq, the number of clusters of S^n 
does not depend on N (denote Q the number of clusters), and that for each q = 1, . . . , Q, the sequences 
{x~ n )n>Ni and (x+ w )jv>jVi converge forwards limits x~ and x+ satisfying xj~ < x\ < x^ < x£ < ■ ■ ■ < 
Xq < Xq. In this context, theorem|4]implies that almost surely, for each e > 0, each eigenvalue belongs 
to one of the intervals [x~ - e, x+ + e] for N large enough. 

We now establish the following property, also well-know in the literature and referred to as "exact 
separation" (see e.g. |3] in the context of non Gaussian correlated zero mean random matrices). 

Theorem 5. Let a, b £ IR, e > 0, Nq e M such that{a-e, b+e) r~\£P N = 0forN> Nq. Then, with probability 
one, 

cardjfc : A^jv < a] - cardjfc : A^jv < w jv(«)} (12) 
card{fc : Ajt.jv > b] - card{fc : A^jv > w^ib)} 

for N large enough. 

Under the above simplified assumptions, this result means that almost surely for N large enough, 
the number of sample eigenvalues that belong to each interval [x~ - e, x+ + e] coincides with the num- 
ber of eigenvalues of BjvB^ that are associated to the cluster [x~ N ,x* N ]. To prove theorem[5] we use 
the same technic as in (28), where a less general result is presented in the case c < 1. 

3.1 Preliminary results 

We first need to state preliminary useful lemmas. The first lemma is elementary and is related to the 
solutions of the equation 1 - a 2 c^fy{w) - 0. 

Lemma 3. The equation 1 - a 2 CNfy{w) - admits K+ 1 real solutions zo,jv < < z\ } n < Ai jv < ■ ■ ■ < 
zk,n < Ajf.iv. IfcN < I, zo,n < w \n w hil e if c N - 1> -Zo,jv = u>i N - Moreover, for each k- 1,...,K, each 
solution Zk,N belongs to the interval {w~ N , w* N ) containing eigenvalue Aj^jv, with q £ {1, . . .,Qat}. 
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The next two lemmas are fundamental, and were proved by Haagerup-Thorbjornsen in [17] in the 
Wigner case models (see also (8j). Lemma [4] and [5] are established in [28] Prop. 4, Lem. 2 and proof 
of Th.3]. Note that, unlike section [2] the Gaussian assumption is required here. We give here some 
insights on the proof of these two lemmas for the reader's convenience. 

Lemma 4. Lety/E S^CR, U), independent of N , then 



E 



1 



Proof: Using the integration by part formula (see e.g. [ 24 1 , 1 25 1 ) and the Poincare inequality for Gaus- 
sian random vectors 1121 , it is proved in [28] Prop.4] that 



E[m N {z)] = m N {z) + 



Xn{z) 
N 2 ' 



where %n is holomorphic on C\U and satisfies 

\ XN {z)\<P 1 (IzDPzdlmtz)" 1 !), 



(13) 



(14) 



with Pi, P2 two polynomials with positive coefficients independent of N, z. The Stieltjes inversion 
formula gives 



E 



— limlmff y/{x)E\ffiM{x+ iv)]dx\, 
n yio Ur J 



(15) 



as well as J R i//(A)d/ijv(A) = 7r _1 limyjoIm(/ R i//(x)mjv(x+ iy)dx). The polynomial bound (14) implies 
the bound limsup > ,j / K i//(x) |xiv(x+ iy)\ dx < C < 00, with C > independent of N (a result shown in 
[8] Sec.3.3] using the ideas of 1X71 1. Plugging (13) into (15), we obtain the desired result. □ 

Lemma[5]is not explicitely stated in [28], but it can be proved easily using the derivation of [28] eq. 
(37)]. 



Lemma 5. Let y/ e 

Then, we have 



independent ofN and constant on each cluster ofS^N for N large enough. 



Var 



M 



1 



Proof: We only give a sketch of proof for the reader's convienence. Using the Poincare inequality for 
gaussian random vectors, we obtain 



Var 



-Tri[f{Z N X* N ) 



where the last equality follows from the application of lemma [4] to the function A >->■ Ay/' (A) 2 . The 
conclusion follows from the observation that this function vanishes on S^n for all large N. □ 

We are now in position to prove theorem[5] 



3.2 End of the proof 

We first prove (12) and assume that a > because (12) is obvious if a < 0. We consider 77 < e and assume 
without restriction that < 77 < a. We consider a function yr a e ^"^(IR.R), independent of N, such that 
fa e [0, 1] and 



Va(A) = 



1 VAe [0,^-7?] 
\/A>a. 
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By lemma[4] we have 



or equivalently 



Lemma[5]also implies that 



Var 



M 



Tri// a (ZivZ^) 









P U 4 ) 



1 

N 2 



Therefore, Markov inequality leads to 
1 



JTy/ n (ZjvZ^) - Hn ([0, a - 77]) 



M 



< N 8/3 Var 



+ N 



N 4/3 



8/3 



— Tri/A fl (ZjvZ^) - |ijv ([0, a - 77]) 



AT 4 ' 3 



which implies that with probability one 

1 



M 



Tn// a [Z N Z* N ] = HN[lO,a-T]])+t 



I 1 



(16) 



The remainder of the proof is dedicated to the evaluation of ^jv([0, a- 77]). Let In = max{<7 : x+ N < a}. 
It is clear that /Ujv([0, a- 77]) = £J^ X MaKI^jv'^jv^ Decause jUjv((« _ = 0. By theorem [2] jUjv is 
absolutely continuous with density ^ _1 Im(mjv(x)). Therefore, it holds that 



-Im 



K/v 



mjv(x)dx 



(17) 



In order to evaluate the righthandside of fP71 , we use the contour integral approach introduced in | 
For this, we consider the curve ^€q^ defined by 

^q,N = {w N (x):XE [X~ N ,Xq N ]}\j{w N (x)* : X £ [JC~ N , X+ N ] j . 



We notice that x — ► w^{x) (resp. x — * WjvM*) is a one-to-one correspondance from (x 
{w N {x),xe (x~x^ N )} (resp. {w w (x)*,xe (x^ JV ,x+ JV )}) because if w N {x) = w N [y), then (f> N (w N [x)) 



qJi' x W ° nt ° 



q,N'^q,N 



x - cf>N{WN(y)) = y (see lemma|2] item[5). 

It follows from lemma |2] items [T]|4] and [7] that S^jv is a closed continuous contour enclosing the 
interval iw~ N ,w^ N ). *€q t j<s is differentiable at each point except at w~ N and w* N (see item[8]of lemma 
[2). However, ITOl and fTT) imply that |it/CJ is summable on [x 



v q,N' X q,N' 



Therefore, for each function 



g continuous in a neighborhood oftfq^, satisfying (g(w))* = g{w*), it is still possible to define the 
contour integral g[w)dw by 



g{w)dw - 2i\m 



q,N 



(/-" 



g(wjv(x))w^(x)dx 



The notation means that the contour "t^jv is clockwise oriented. Although ^ ( jv is not differen- 
tiable, the main results related to contour integrals of meromorphic functions remain valid. In partic- 
ular, it holds that 



Ind^tf): 



-f 

2n 1 J<t 



dA 



iffef 
iff £ w 



q,N' W q,N\ 
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In order to evaluate the righthandside of (T7J using a contour integral, we remark that 



m N {x) = 5 \/xER\dS?i 



l-a 2 N c N fN{w N M) 



N 



(see (|8) and item [3] of lemma |2). Moreover, by item [5] of lemma [2] we have w' n {x)^)' n {wn{x)) = 1 on 



(x N ,Xq N )- Therefore, we have 



+ 1 f x i' N , 
^N ( ,[x qiN ,x qN \) = -Im J g N {w N (x))w N {x)dx 

71 \ Jx q,N I 



\ 

(18) 



where gjv(w) is the rational function defined by 

f N {w)(p' N {w) (1 - (J 2 c N f N (w)) 2 - 2a 2 N c N wf' N (w){\ - o 2 N c N f N {w)) - (J%c N {\ - c N )f N (w) 

gN(w) = - 2 TTT = J n{ - w > 2 TT~>, ■ 

l-a N c N f N {w) l-a A N c N f N {w) 

In order to justify the existence of the integral at the righthandside of (18), we prove that gjv(n') is 
continuous in a neighborhood of ^q.N- We first note that the poles of gNiw) coincide with the eigen- 
values of BnB^ and the zeros (Zjt,jy5 k=o,...,K of 1 - o 2 n Cn fN^w) . As Wn(x) is not real on [x~ N ,x* N ), 
x —> gN{u>N{x)) is continuous on (x~ N ,x+ N ). It remains to check the continuity at x~ N and x+ N . If 
cn < 1, w~ N — wn{x~ N ) and w^ N - u>n{x^ n ) do not coincide with one the poles of gNiw)- Ucn- 1 
and q - 1, this property still holds true except for u\ N - Wn(x^ N ) = wjv(O) because zo.jv = if j~ N (see 
lemma|3). However, if cjv = 1, the solutions of 1 - ct^c/v/aKha) are not poles of g;v due to a pole zero 
cancellation. 

Therefore, it is clear that Vn[[x~ N ,x* N ]) can also be written as 

^n([x~ iN ,x^ n ]) = ^- I g w (A)dA. 

The integral can be evaluated using residue theorem and we give here the main steps of calculation. 
Define = {fee {1,2, ...,K} : Afc.jv e ( w ~ N , w q n^ ano - ^q - card(J^) > {L q > fromlemma[T]item|3}. 
Assume cn < 1. Since ^q,N only encloses {w~ N , N ), we will have residues at the following points: 

• for q = 1: residues at Zo,w> and z^jv. Afc,jv for k e 

• for q > 2: residues at Zfc ; Ar, A^jv for ke^q. 

If Cjv = 1, the zeros of 1 - (J 2 Cn /n(w) are not poles of gNiw): 

• for q = 1 : residues at and Afc ( jv for A; e J 2 ! . 

• for c/ > 2: residues at A^jv for k e JFq. 

We just consider the case cjv < 1 in the following (the calculations are similar for C/v = 1 and are there- 
fore omitted). We consider the decomposition gjv(A) = gi,jv(A) + g2,jv(A) + g3,iv(A), with 

gi,w(A) = /jv(A) (1 - a 2 c N f N W) , 
g2,w(A) = -2(T 2 c N Af N Wf N U), 

g3,2V(A) = -CT Cjv(I-Cjv)- 5 FTTT- 

These three functions admit poles at 0,(Afcjv) fc=1 K , and g^ : N has moreover poles at [Zk,N)k=o,...,K- 
After tedious but straightforward calculations, we finally find that for k e {1, 2, ... , ^C}, 

Res (gi, w , A fc>JV ) = - — + — — ^ , 

M M l?k A l,N~ Afc.JV 

Res(g 2 , w ,A fc , w ) = -^-^£ 



Res(g 3;A f,A fc;J v) 
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For the residues at 0, we get 



M-K ? M-K 1 * 1 
BeB(g^,0)=-— + 2a Cw __£_, 



Res(g 2 , JV( 0) = -2a 2 c N ^^- £ — 
Res(g 3 ,iv,0) = 



M M ,tl h,N 



Finally, the residues at Zk n f° r k — 0, . . . , K are given by Res(g3 jv, Zk n) - ■ Using these evaluations, 



cn 

■ aic b ivc11 u y iica vsa,A/» **:,jv > — ~ 

we obtain immediately that if q > 2, then ; 
^jv([x" n ,x+ jv ]) = - £ [Res(gi, J v,A fc;A f) + Res(g2, w ,A fc;A ,) + Res(g3, A f,A fc , JV ) + Res(g3, A f,z fc>A ,)] = ^p. 

This coincides with the ratio of eigenvalues of BjvB^ associated to the cluster \.x~ N ,xt N ] (i.e. the 
eigenvalues Xk,N f° r k E -^q)- If = 1, 

jUjv([*r,Jv> x i",Af]) = _ Z [ Res A wv) + Res (few. A fc ,Af) + Res (g 3 ,jv, A fc;W ) + Res (g 3 , N , z k>N )] 

ke.J'i 

- [Res (gi, w , 0) + Res (g 2 ,jv, 0) + Res (g 3;W , 0) + Res (g 3 ,jv, z ,jv, ) ] 
_ Li M-K 
~ M M ' 

which also coincides with the ratio of eigenvalues of B^B^ associated to the cluster [x~ N ,x^ „] (the 
Xk,N for ke.J'i and with multiplicity M-K). 
Therefore, using {16), we get that 



(In \ 

K q=l 



N 



1/3 



(19) 



But almost surely, for N large enough, Tri// n (2jv2^) is exactly the number of eigenvalues contained 
in [0, a] because no eigenvalue of ^n^-* n belong to [a - r\, a] (use theorem |4] with a - r\ in place of a). 
The left handside of is thus an integer. Since this integer decreases at rate AT , it is equal to zero 
for N large enough. Q~2} follows from the observation that Lq + M- K is equal to the number of 
eigenvalues of B^B^ that are less than w^ia). 

To evaluate the number of eigenvalues in interval {b, +oo), we use that no eigenvalue belongs to 
[a, b] (theorem|4). Therefore, 

card{fc : Xk,N > b] - M- cardjfc : Afc.jv < a}. 

(12) implies that 

In 

cardjfc : X k , N > b} = M- £ L q - (M- K), 

which coincides with the number of eigenvalues of BjvB^ in interval (u/jv(fo), +oo). This concludes the 
proof of theorem[5] 



4 Applications to the spiked models 

In this section, we use the above results in order to evaluate the behaviour of the largest eigenvalues of 
the information plus noise spiked models. In the remainder of this section, we assume that 
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Assumption A-5: K does not depend on N and for allk- l,...,K, the positive sequence (A j._jv) writes 

^k,N = &k + £ k,N: 

with limjv^+oo £k,N - an d A; ^ \j for i ^ j. 

We define K s = max{fc : A fc > a 2 ^} and the function ifr(A) = t g2 +W g2c+A > . i n the following, we 
characterize the support of measure pjy an d use the above results on the almost sure location of 
the sample eigenvalues in order to prove the theorem 

Theorem 6. We have with probability one, 

x ka fc ) if k<K s 

k,N at-oo ya 2 {\ + sfc) 2 ke{K s + \,...,K} 

We note that theorem[6]was already proved in the recent paper 1 7 1 using a different approach. 



4. 1 Preliminary results on perturbed equations 

We first state two useful lemmas related to the solutions of perturbed equations. They can be inter- 
preted as extensions of lemmas 3.2 and 3.3 of (5). In the following, we denote respectively by @ (z, r), 
S> c [z, r) and ^{z, r) the open disk, closed disk and circle of radius r > with center z. Moreover, in this 
paragraph, the notation o(l) denotes a term that converges towards when the variable e converges 
towards 0. The first result is a straightforward modification of (5] lemma 3.2]. Its proof is thus omitted. 

Lemma 6. For each e > 0, we consider h e {z) - h{z) + Xei z ) with h,%e two holomorphic functions in a 
disk @ (zo, ro). We assume that sup ze g, o(ZO;ro ) = o(l). We consider zo, e - zo + 8 e with 8 e = o(l). 

Then, 3 cq > and r > such that for each 0<c<£q, zo, e e @ (zo , r) and the equation 

z- z 0ie -eh e {z) = 0, 

admits a unique solution in 3> {zq, r) given by 

z e = z 0ie + eh{z ) + o{e). 

Moreover, if we assume thatZQ e U, h{z) e Uforze U., and that fore small enough, zo, e e U, h e {z) e Ufor 
zeR, then z £ e U. 

The second result is an extension of [5J Lem.3.3] to certain third degree equations. The proof is 
given the AppendixJC] 

Lemma 7. For each e > and i = 1,2, we consider hi ie {z) = hi{z) + %i,e{z) with hi,%i,e holomorphic 
functions in a disk& {zo, ro) . We assume that hi (zo) ^ and that sup ze @ ( Zo>ro ) l^'.eU)! = o(l) fori = 1,2. 
We consider zo, e = zq + S e with 8 e — o(l). Then, 3 cq > and r > such that zq iC c 3> {zq, r) Ve e (O,^) 
and the equation 

(z - zo, e ) 3 -e[z- zo, e ) h u (z) + e 2 h 2 , e (z) = 
has 3 solutions in @ [zq, r) given by 

Z~ = Z , e - \fe\/hi{z ) + 0{^) 



z + e = z 0]C + \/e\Ai(-Zo) + o[\/e) 

z e = z , £ + e— — - + o{e), 
«i(zo) 

where J~. is an arbitrary branch of the square root, analytic in a neighborhood ofh\{zo). Moreover, if we 
assume thatzo e U, h\{z) e U forze U and that fore small enough thatzo iC e R, hi iC {z) e Uforz e U, then 
z e is real. Moreover, if hi (zo) > then z~ , Z+ and z e are real while z~ , z+ are non real if hi (zo) < 0. 
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4.2 Characterization of and limits of the eigenvalues if ^ a 2 \[c 

In this paragraph, we identify the clusters of the support 5^v, and evaluate the points x~ N ,xt N for 
q - I,..., Qn- From theorem^ these points coincide with the positive extrema of function <pN (defined 
in (7J), and eventually x~ N - if C/v = 1. Therefore, we first evaluate the real zeros of <p' N {w) - (1 - 
a 2 cn fNiuf)) 2 - 2a 2 cnw f' N {w){\ - a 2 cn fNiuf)) - a 4 cn{\ - CN)fL(w). Straightforward calculations give 



with 



(p' N {w) = 



w 2 Uk^Uk,N-wy d 



i i 

Y\,n{w) + — 72,iv(w) + j3,Niw) 



Ti,n(w) = ( w 2 - (T 4 c N ) Y\ (A fc;W - wf 
k=l 

K K 

T2,n(w) = -2a 2 c N Yl (A k>N - u>) £ 

k=l j=l 



2 2 a 2 {\ + c N )Xi N ' 
w +a (\ + Cn)w 

2 



1=1 



K K 

E X k,N EI (A '.W " w ) 
fc=l Z=l 



if if 

£ A fc;iv (3w- X kiN ) Y\(A.i, N - wf 
k=l 1=1 
Ijtk 



Therefore, cp N {w) = if and only if 



1 1 

Jl.Niw) + —J2,N{W) + -^73,n(H/) = 0. 



(20) 



We assume c < 1, which implies that c/v < 1 for N large enough. The calculations are essentially the 
same if c = 1, We first observe that the zeros of (pN are included into a compact interval JP independent 
of N (see the proof of Corollary fl). Next, we claim that for each a > 0, it exists /3 > and No £ N such 
that 



1 1 

Ji,n{w) + — 72,iv(w) + —2Y%n{w) 



if N> No and | w- a 2 \fc\ > a, \w + a 2 sfc\ > a,\w - A k \ > a,k = l,...,K and wei. This follows imme- 
diately from the inequality 



1 1 

Jl.Niw) + — J2, N {W) + -j^JXNiw) 



1 1 

5 l7l,N(w)|- -j^Y2,max - J^T^max, 



where ji.max - max !( , eJ ? \fi,N( w )\ for J = 2,3. This shows that the solutions of eq. (20) are located 
around the points a 2 \fc, -a 2 \fc, X k , k- 1,...,K. 
In a disk Q) {a 2 \fc, r), (20) is equivalent to 



w-a 2 \fc N + 



1 w-a 2 s/c N 



1 

72,iv(w) + — y 3 ,Af(w) 



M 7i,jv(w) 

We use lemma[6]with e = M~ l , zq = a 2 \[c, zq iC - a 2 \fc N , and the functions 



(21) 



h e {w) ■ 



{w-a 2 y/cN) 
Ji,n{w) 



1 

72,Af(w) + — 73,w(w) 



and h[w) = Hitim— +oo h e [w). h{w) is obtained by replacing c/v and the {^k,N)k=i,...,K by c an d the 
{Xk>k=i,...,K m the expression of h e . Lemma [6]implies that it exists r for which equation (2D , or equiv- 
alently equation (20), has a unique solution in <3) {a 2 \fc, r) for M large enough. This solution is given 
by a 2 y^cjv + & ( jj ) ■ It is easy to check that 



(pN [o 2 sfc^+ <9 (M -1 )) = cr 2 (1 + v^w) 2 + < 
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This quantity is positive, thus showing that o 2 sJ~Cn + G{M~ l ) is the pre-image of a positive extremum 
of (pN- Exchanging o 2 \fc with -o 2 \fc, we obtain similarly that it exists a neighborhood of -a 2 \fc in 
which equation (20) hasaunique solution given by -a 2 ^/cn + 0{-^fi- Moreover, 

<Pn {-° 2 + O (AT 1 )) = a 2 ( 1 - v^w) 2 + ( ) , 

so that -<j 2 ^/~cn + is also the pre-image of a positive extremum of <pN. 

We now consider i e {l,...,K}, and study the equation (20) in a neighborhood @ (A,-, r) of A;. In 
order to use lemma [7] we put e — -tj, Zo = A,, Zo ;£ = Ayy. It is easily seen that in ^ (A^, r), eq. (20) is 
equivalent to 



o 1 1 

(if- A,-,]v) -— {W -Aj jW )fti >e (lf) + — yh 2 ,etw) = 0, 



where 



hl,e(w) = 

h 2 Aw) = 



Afc_JV ( if 2 + (T 2 (l + Cjv) If - 



(Tt.l+C N )Xk,N 



If)" 



(lf 2 -(7 4 CAf)nfc =1 (Afc,iv- If) 2 
k+i 



{W 2 -^C N )U^iUk,N-W) 3 



We denote by h\{w) and hziw) the limits of h\ ie {w) and hz :£ (.w) when e — ► 0, i.e. the functions ob- 
tained by replacing Cjv and the {^k,N)k=i,...,K by c and the (Afc)fc = i ( . i>( jf respectively in the expressions of 
hi, e , h2, e - After some algebra, we obtain that 



2o- z cAf(A,- + 2-ii±£l) 



while h 2 {Ai) is equal to 



2ct 4 c 2 A? 



Lemma[7]implies that it exists r such that 



H.N " 



1 a cXi 



M A; + a 2 lM 



+ o 



is solution of (20) contained in @ (A,, r). It is however easy to check that 



A, 



N ' 



1 a cXj 
MXi + a 2 ^ 



(T 4 (l-C) 2 

2A~ 



(!-I)<0. 



Therefore, the above extremum is negative, and its pre-image cannot be one the points if 
Moreover, if A,- < a 2 \fc, then h\{X() < and (20) has no extra real solution in Qi {Xi,r). If A,- > a 2 \fc, 
then fai (A;) > 0, and the quantities 

1 



A,-,jv = V^i Ui) + o 



1 



1 



and A,- w + — — J h\ (A/) + o 
VM 



1 



are the 2 other real solutions of (20) contained in @ (A;, r). After some algebra, we get that 



<pN Xi,N ~ —p= V hi (A;) + O -— = 



1 \\ (A ; - JV + cr 2 c i v)(A i - J v + (7 2 ) 1 2^/ii(A i )(A 2 -cr 4 c) 



0jv ^A iiJV + —=\/hx (A,) + o 



M 
1 



A, 



A' 



M 



A? 



■ + o 



(Aijv + ^c^tAijv + CT 2 ) 1 2 v / /ii(A i -)(A 2 -cr 4 c) / 
— 1 : '■ + — = + o 



: ,A T 



^ 2 



1 

MJ 
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are both positive. It is easy to check that if k < K s , then, a 2 v / cjv < A^at for iV large enough. The above 
discussion thus implies that has K s + 1 clusters, and that for k< K s , then 



ct 2 (1-^) 2 + < 



_{X kN + (j 2 c N ){X kN + a 2 ) 1 2 y /Jn(X~ k ~ia 2 k -a 4 c) I l 



(A kiN + (j 2 c N )Uk,N + cr 2 ) 1 2yJh l U k )(A 2 k -cr 4 c) I i 



■ + — = o + o 

I 2 



kiv v 7 !? A 2 \s/M 



In order to complete the proof, we use theorem[5] Let k e {1, . . . , K s }. From the previous analysis, the 
eigenvalue A^jv is the unique eigenvalue of BjyB^ associated with interval {u>~ N , w* N ) with q-K s - 
k + 2, for N large enough. Moreover, the number of clusters of is equal to K s + 1 for N large enough 
and the sequences x~ N andx^ converge towards limits equal to a 2 {l-\/c) 2 ando" 2 (l + \/c) 2 for q = 1, 
and both coincide with if/{X^ s +2-q) f° r 1 - 2- This implies that for each e> 0, almost surely for TV large 
enough, then A^jv e (i/ / '(A/ c )-e,i//(Aj ; )+e) for A; = 1,... ,K S and that Ajt,jv £ (<r 2 (l- v / c) 2 -e,CT 2 (l + v / c) 2 +e) 
for k > K s . This shows that Afc ; jv ~~ " V^fc) for fc = 1, . . . , K s . 

We now prove the convergence of Xk,N to cr 2 (l + y/c) 2 for K s < k < K. Let A; max = K s + 1 (i.e the 
index of the largest eigenvalue associated with the first cluster [x^ N ,x^ N ]). We have already shown 

limsup^Afcj^jv £ o" 2 (l + \/c) 2 almost surely. It remains to prove liminfjv A^^jv - o" 2 (l + \/c) 2 . Assume 
the converse is true. Then it exists e > such that liminfjv Ajt max ,Ar < <7 2 (1 + s/c) 2 -e. We can thus extract 
a subsequence Afc maxj ^(]v) converging towards a limit less than o 2 [\ + \fcj 2 -e. Let/i^jv) be the empirical 
spectral measure associated with matrix Z^(jv) ^ . We deduce that 

A0(iV)((°" 2 ( 1 + v / c) 2 -e,o- 2 (l + v / c) 2 ])=O a.s for all large N. (22) 

Theorem[T]implies that /amjv) converges torwards the Marcenko-Pastur distribution, which contradicts 
(22). This proves that Afc max Ar - " °" 2 (1 + \fc) 2 with probability one. We can prove similarly that %k,N ~~ " 
a 2 (l + sfcf a.s for K s + 1< k < K. 

4.3 Characterization of S?n and limits of the eigenvalues if a 2 \fc e { Ai , ... , A#} 

In this section, we handle the case where one the (Ajt)ifc=i,...,jfi say Aj with j < _RT, is equal to a 2 \fc. For 
this, we will use the Fan inequality (see [15 Th.2]). For a rectangular matrix A, we will denote by K k (A) 



its k-th largest singular value. With this notations, we have kj (B) = JXj - \J a 2 %fc. We also denote by 
u^jv and Vj jv the left and right singular vector of Bn associated with jfy(Bjv). Fan inequality gives, for 



e>0, 



kj (Bjv + o-Wjv) < kj (Bjv + crW N + e\ij iN v* N ) + Ki(eUj iN v* JN ), 
kj(B n + (tVI n + ev.j iN v*j N ) < Kj{B N + aW N ) + K 1 (euj iN v* N ). 

From the results of the previous section, it is clear that, almost surely, 



Kj{B N + aWf N + eUj, N v* ]iN ) = Wi^y^+e) j + o(l). 
Therefore, we end up with 



ly/\^J~X~j + ej j-e < liminf Kj [B N + aW N ) < limsup Kj {B N + aW N ) < Jifr^\Jx~j+e^ 
Since iff (A) — ► cr 2 (l + \fc) 2 when A — ► a 2 \fc, this completes the results of theorem[6] 



2\ 



+ e. 
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A Proof of item |2] of theorem [2] 



The proof is a direct consequence of Section 2 of [13] (formulas (2.1) to (2.5) of [13]). We first recall that 
if we denote by g(z) and G(z) the terms defined for z £ C + by 

|1 + a d cm Wjv(z) I M 

G(z) = a 2 c N — TrT N {z)T N {z)* , 
M 

where Tjv(z) = |-z(l + er 2 Cjvmjv(z))lM + cr 2 (l - C]v)Im+ i + J^ B ,„ [z) j , then, it is shown in [13] that 

0<\z\G{z)<l-g{z) (23) 

for each zeC + . Ifz x =Re(z) and z 2 = Im(z), (23) implies that < 1- g{z)-\z 1 \G{z) < 1 - g(z) + ziG(z). 
It is shown in [13) that 

2 , „ l + o 2 {\-c N )G{z) + \m{l + a 2 c N m N {z))z 2 G{z) 

Re(l + CT c N m N {z)) = — — — , 

l-g(z) + ZiG(z) 

for ze C + . As Im(l + cr 2 Cjvmjv(z)) = cr 2 cjvlm(mjv(z)) > on C + (see item|3]of PropertyH), we get that 



Re(l + <7 2 Cj\/Wjv(z)) > > 



1 1 



l-g(z) + ziG(z) l + ziG(z) 

The inequality |zi|G(z) < 1 implies that Re(l + cr 2 Cjvmjv(z)) > i for each z £ C + . This also implies that 
Re(l + (T 2 CivW]v(x)) > j for x £ IR if cjv < 1 and for x e R* if Cjv = 1. 

B Proof of items [6] and |8] of lemma[2]when q = 1 

In order to prove these 2 statements, we study the behaviour of w jv(x) and of w N {x) when x — ► 0, x < 
and x — » 0, x > 0. 

We first look at the limit for x < 0. Lemmas Q] and [2] imply that wjv is the inverse of <pN on inter- 
val (-oo,0). if jv(x) is a continuous increasing function on (-oo,0) upperbounded by w7 N ; therefore, 
lim x ^o,x<o Wjv(x) exists, and is less than w7 N . Taking the limit when x — «■ 0,x < from both sides of 
the equation (Pn(u>n(x)) = x for x £ (-oo,0), and using the continuity of cpm on (-oo,0], we obtain im- 
mediately that 0iv(lim^o,x<o u>n(x)) = 0. This implies that lim x ^o,x<o wnM - w7 N . This shows that 
wn is left continuous at x = 0. Since ifjv(x) = x(l + cr 2 mjv(x)) 2 , it follows that l + cr 2 mjv(x) = <9(|xr 1/2 ). 
As Wn is continuously differentiable on (-oo,0), we can differentiate the relation ^>jv(^jv(x)) = x, and 
obtain that (p' N { if jv(x)) w' n [x) - 1 for x < 0, or equivalently that w' N (x) = > [^i^J - m other words, it 
holds that 

, 1 

w N {x) = - - ; . (24) 

[1 - a z f N { wjv(x))] [1 - a z f N {w N (x)) - 2a l w N {x)f N { w N {x))] 
We observe that 1 - a 2 fy(w7 N ) = so that 

lim 1 - a 2 f N {w N {x)) - la 2 w N {x)f N {w N {x))) = -2cr 2 w7 N f N {w7 N ) ^ (25) 
Moreover, 10 implies that 

1 n 

t— — ■ = 1 + a mjv(x) for x < 0, 

l-a z f N {w N {x)) 

which proves that {\- a 2 f N {w N {x)))~ l =&{{-x)~ llz ). (24) implies immediately that w' N {x) = <^-J=J, 
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We now study the behaviour of and w' N when x — ► 0, x > 0. We first study \pxm^{x) for x — ► 
0, x > 0. For this, we introduce the function i//(£, y) defined by 

y( f ,y) = 1 -lT 3c (B^^-f(y + ^o) 

The introduction of 1// is based on the observation that eq. (6) is equivalent to ■yi-Jxm^x), \fx) = 
for x > 0. We denote by £0 the term £o = io~ 2 w ^ N \ and notice that i//(fo,0) = 0. It is easily checked 

that y/ is holomorphic in a neighborhood of o. 0) and that ^ (£0, 0) ^ 0. Therefore, from the implicit 
function theorem (the analytic version - see e.g Cartan QT] Prop.6]), it exists a unique function £(y), 
holomorphic in a neighborhood Y of satisfying (y),y) = for ye7 and £(0) = fo- As Im(£o) > 0, 
it is clear that it exists a neighborhood Y of included in Y such that Im(if(y)) > for each y £ Y . 
We claim that for \fx eY n U + *, = s/xm^ix). For this, we notice that if x £ (O.x^), m/v(x) 

is the unique solution of Eq. (6) for which Im(mjv(x)) > 0. Indeed, from item|5]of lemma[2] for x £ 
(0, je*^), ifjv(x) is the unique solution with positive imaginary part of equation <Pn{w) = x. But, m^ix) 
is solution of (6) iff ifjv(x) is solution of c(>n(w) = x. Moreover m^ix) £ C + iff wjv(x) £ C + , a property 
which is readily seen from the relation (8). The conclusion follows from the observation that m^ix) 
satisfies iff v^ra/vCx) satisfies y/(.^/xmN(.x), y/x) = 0. This in turn shows that for each \fx eY n U + * , 
- \fxm^[x), or equivalently that f(y) = ym^iy 2 ) for y £ F n As £(y) is holomorphic in 
y , ^(y) = <fo + o(l) anQ s (y) - si + o(l) for some coefficient fi. Therefore, ym^{y ) = £o + o(1) and 
2y 2 m N {y 2 ) + mjv(y 2 ) = fi + °(1) f° r y e ^ n ^ + *> or equivalently \fxm.N{x) = fo + o(l) and 2xm JV (x) + 
rriNM = £1 + o(l) for x > small enough. As wjv(x) = x(l + o" 2 mjv(x)) 2 , we get that 

w N {x) — (l + cr 2 mjv(x)) |l + o" 2 (m/v(x) + 2xra JV (x))j . 

As (mjv(x) + 2xm w (x)) is a (9(1) term, and as mjv(x) = + o(-^), we obtain that | ia^MI < for x > 
small enough for some constant C > 0. 



C Proof of lemma[7] 

We begin by choosing r > and £i > such that r < tq, zq iC £ ® c (zo, r) and @ c (zo )C , r) c @o(zo> ro)> for 
each < e < e\. Let f E {z) — (z - zq iC ) 3 - e{z - ZQ t£ )h\ te {z) + £ 2 fa2 >e (z) and g e {z) = (z- zo, e ) 3 . Moreover, 
define §■ = sup @c(zo r) |^, (z)| (for i = 1,2). 

Assup ze@o(Zo>ro) \Xi,eiz)\ = o(l), it exists e 2 <ei such that sup @cUo _ r) |^, e (z)| < (fori = 1,2) for each 
e < £2- For z £ @ c (zo, r), it holds that 

\feiz) - g e {z)\ <e\z-Z 0ie \ \ hl, e (z) | + e 2 | /l 2 ,e (z) \ ■ 

As Zo, e - zq - o(l), it exists £3 < £2 such that, for each e < £3, |z - zo, e | < 2r on @ c (zo, r). Hence, for 
each £ < £3, it holds that \f £ {z) - g e (z)| < 2£r^i + £ 2 -K2 on @ c (zo, r). We now restrict z to ^ (zo, r), the 
boundary of @ c (zo, r). It exists £4 < £3 for which 2£r7Ti + £ 2 _K2 < '— < - \ z - zq\^ holds on ^(zo, r) for 
each£ < £4. Therefore, Vz £ ^(zo, r), we have \f e {z) - g e (z)| < |g e (z)| for £ < £4. It follows fromRouche's 
theorem that these values of e, then f £ and g e have the same number of zeros inside S? (zo, r). Thus, 
for £ < £4, the equation 

(z- zo, £ f - c{z-zo, £ )h 1<£ [z) + e 2 h 2 , e {z) = (26) 

has three solutions in @ (zo, r). Using the the same procedure to functions f E {z) - (z- zo, e ) 2 - eh\ iC {z) 
and g e (z) = (z - zo, e ) 2 , we deduce that if £ < £5 < £4, the equation 

(z-z , e ) 2 -£/i u (z) = (27) 

has two solutions z~,z^ in @ (zo, r). We clearly have |zo, e - z~\ - &{e 112 ) and |zo - z~\ = o(l). There- 
fore, h\ ie {z~) - h\{zo) = o(l). As fai(zo) ^ 0, it exists £6 < £5 and a neighborhood of h\{zo), contain- 
ing fai e (z~), /zi, e (zo) for each £ < e§, in which a suitable branch of the square-root yC is analytic. We 
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assume that solution z £ is given by zq i£ — z £ = -\fe.yjh\ i£ [z £ ). As |/ii(zo) - h\ i£ {z £ )| = o(l), we have 
•Zo.e - % = - \fe\Jh\ (zo) + ofx/e). We obtain similarly that zo, e -z^ = x/^x/^it-Zo) + o{yfe). 
Considering again z~, it follows that it exists e-i < e§ such that for each e < £7, it holds that 



1 A _, v^V^iCzo) r n 

\z , e -z e \> - > VeVr', (28) 

with r' < i^i^ali. For e < eg < £7, we have \fe~F < r and for z e @ c (zo je , Ver 7 ), we get 
I (z - z , e ) 2 - (z) \>e\h u {z)\-\z- z , e | 2 > e (| ft 1)£ (z) | - r j . 
It is easy to check that for each e < C9 < e 6 , then | h\ >£ (z) | > 1 ' 1 ^ o)l for z e @ c (z 0>e , x/er 7 ) . Therefore, 



(z-z , e ) 2 -e/ii, e (z)|>e[ '"^ — -r f l >er f . (29) 



|fti(zo)l 



The inequalities (28) and (29} prove that in @ c (zo, e , x/er 7 ), the equation (27) has no solution and that 
the equation (z- zo, e ) 3 — e(z- Zo, e )^i, e (z) = has only one solution there. 

We now study the number of solutions in @ c [zo, £ , Ve~F) of the equation (26) . Consider 

fe (Z) = (Z - Z , e ) 3 - e(z - Z , e ) h\, £ (Z) + C 2 ft 2 ,e U)> 
g e (z) = (z - z , e ) 3 - e(z - Z , e ) ^l, e (z) . 

We have |/ e (z) - g e {z)\ = e 2 \h z , e {z)\. We consider z e ^(z , e , x/er 7 ). From (29), |g c (z)| > (cr') 3/2 . There- 
fore, for each e < cio < £9, it holds that |g e (z)| > e 2 1 ^2,e C-z) I = 1/eU) _ geU)l- Thus, from Rouche's theo- 
rem, the equation (26) has only one solution in @ (zo, e , \fe~F), denoted by z e . To obtain z e , we write 

-e 2 h 2 ,e(Ze) 

z e ~ z 0,e — 



(z e -zo, e ) 2 -c/Ji, e (z e )' 

Since |(z- zo, e ) 2 -eh\ j£ {z)\ > er on@ c (zo, e , VeF) (see (29)), we get that 

1 . e 2 K 2 ^ 

\z £ -zo, £ \ < — -=©{e). 
er 

But from equation (26) , we also have e{z £ - ZQ )£ )hi t£ [_z £ ) = [z E - zo, e ) 3 + e 2 h 2i£ (z £ ) which leads to 

h 2 , £ (z £ ) (z e -z 0>e ) 3 

z e — zq £ — e 1 . 

h h£ (z £ ) ehi, e (z £ ) 

It is clear that 

h 2 , £ {z £ ) h 2 {z ) 



hi, £ {z £ ) /ji(z ) 

so that 



= o(l), 



h 2 (zo) 

z £ -z 0i£ = e +o(c). 
«i (zo) 

We now evaluate the two remaining solutions of 126) located in the set @ (zo, r) \@ (zo, e , VeF), denoted 
z~ , z+ . As I z~ - zo, e I > vr 7 e, we can write 

, ^? 9 ^2 e(zJ) 

(z e - - zo,,) 2 = eh u (z e ") - e 2 _' (30) 

This implies that |z~ - zo, e | = ©We) and that \z~ - zq\ = o(l). Taking a suitable branch of the square 
root, (30) implies that 



z £ - z , e = -Jeh u {z £ ) + o(\/e) = -yjeh\ (z ) + o( v / e). 
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We obtain similarly that z+ - zo, e = \Jeh\izo) + o(\/e). 

We finally verify that if zq and zo, e belong to U for each e, and that hj (z) and hi e (z) belong to K for 
each e if z e D? for j = 1,2, then z e is real while z~,z+ are real if h\{zo) > 0. 

If z e is not real, it is clear that z* is also solution of (26) because functions hi iC verifies [hi )C {z))* = 

hi tE {z*). As |z* - Zo je | = \z £ - Zo je | -G(e), and that (26) has a unique solution in the disk @ (zo, e , x/er 7 ), 
this implies that z* = z e . On the other hand, assume that h\ [zq) > and the z~,z* are non-real. Then, 
z+ * and z~ * are also solution of (27). Since equation (27) has only two solutions outside the disk 

2> (zo,e, Ver 7 ), it follows that z+ and z~ are complex conjuguate. But as their real parts have opposite 
sign for e small enough, this leads to a contradiction. Therefore z<t and z~ are real. We finally note that 
if hi (zo) < 0, then z+ and z~ are non real. 
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